Mint: MDL-based approach for Mining INTeresting Numerical Pattern Sets
نویسندگان
چکیده
Abstract Pattern mining is well established in data research, especially for binary datasets. Surprisingly, there much less work about numerical pattern and this research area remains under-explored. In paper we propose Mint , an efficient MDL-based algorithm The MDL principle a robust reliable framework widely used mining, as subgroup discovery. reuse discovering useful patterns returning set of non-redundant overlapping with well-defined boundaries covering meaningful groups objects. not alone the category miners based on MDL. experiments presented show that outperforms competitors among which IPD, RealKrimp Slim .
منابع مشابه
Mining Interesting Contrast Sets
Contrast set mining has been developed as a data mining task which aims at discerning differences across groups. These groups can be patients, organizations, molecules, and even time-lines. A valid contrast set is a conjunction of attribute-value pairs that differ significantly in their distribution across groups. The search for valid contrast sets can produce a prohibitively large number of re...
متن کاملTractable Interesting Pattern Mining in Large Networks
Pattern mining is an important data mining task. While the simplest setting, itemset mining, has been thoroughly studied real-world data is getting increasingly complex and network structured, e.g. in the context of social networks, economic networks, traffic networks, administrative networks, chemical interaction networks and biological regulatory networks. This presentation will first provide...
متن کاملA Causal Approach for Mining Interesting Anomalies
We propose a novel approach which combines the use of Bayesian network and probabilistic association rules to discover and explain anomalies in data. The Bayesian network allows us to organize information in order to capture both correlation and causality in the feature space, while the probabilistic association rules have a structure similar to association mining rules. In particular, we focus...
متن کاملNDPMine: Efficiently Mining Discriminative Numerical Features for Pattern-Based Classification
Pattern-based classification has demonstrated its power in recent studies, but because the cost of mining discriminative patterns as features in classification is very expensive, several efficient algorithms have been proposed to rectify this problem, such as DDPMine [3], corrmine [14], and gPLS [17]. DDPMine obtains its efficiency by iteratively pruning the input dataset thereby potentially lo...
متن کاملPreserving Privacy for Interesting Location Pattern Mining from Trajectory Data
One main concern for individuals participating in the data collection of personal location history records (i.e., trajectories) is the disclosure of their location and related information when a user queries for statistical or pattern mining results such as frequent locations derived from these records. In this paper, we investigate how one can achieve the privacy goal that the inclusion of his...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data Mining and Knowledge Discovery
سال: 2021
ISSN: ['1573-756X', '1384-5810']
DOI: https://doi.org/10.1007/s10618-021-00799-9